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TARGETED MOLECULAR BAR CODES AND 
METHODS FOR USING THE SAME 
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CROSS-REFERENCE TO RELATED APPLICATIONS 
Pursuant to 35 U.S.C. § 1 19 (e), this application claims priority to the filing date of 
the United States Provisional Patent Application Serial No. 60/1 1 1,802 filed December 11, 
1998 and United States Provisional Patent Application Serial No. 60/158,020 filed October 6, 
1999, the disclosures of which are herein incorporated by reference. 

INTRODUCTION 

Field of the Invention 

The field of this invention is analyte detection, particularly labels employed in analyte 
detection. 

Background of the Invention 

Analyte detection, in which a sample is assayed for the presence of a particular 
analyte of interest, is critical to a variety of fields. For example, analyte detection protocols 
are employed in the characterization of complex mixtures, the identification and 
characterization of novel compounds of interest, and the like, both in industry and academia. 
Analyte detection also plays a role in medicine, particularly in the diagnosis of disease. In 
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such applications, a sample from a patient is assayed for the presence of one or more specific 
analytes indicative of the disease being diagnosed. Environmental monitoring applications, 
such as the monitoring of pollutants and toxins, also employs analyte detection protocols. 

Where an analyte of interest is not readily measured directly, the use of a detectable 
label is required. A variety of detectable labels have been developed over the years for use in 
analyte detection. These include both directly and indirectly detectable labels. Directly 
detectable labels include radioisotopes, enzymes, fluorescent and chemiluminescent agents, 
or other labels capable of being directly detected . Where an indirectly detectable label is 
employed, it is generally a member of a signal producing system such as a second stage 
antibody or reagent that is used to amplify the signal. Such reagents are well known in the 
art. For example, the primary antibody may be conjugated to biotin, with horseradish 
peroxidase-conjugated avidin added as a second stage reagent. Alternatively, the secondary 
antibody may be conjugated to a fluorescent compound, e.g. fluorescein, rhodamine, Texas 
red, etc. Final detection uses a substrate that undergoes a color change in the presence of the 
peroxidase. 

Despite the wide variety of different labeling agents that have been developed over 
the years for use in analyte detection, there is still an interest in developing new ones. Of 
particular interest would be the development of sensitive labels that can simultaneously detect 
small quantities of analytes present in complex mixtures by methods that do not require 
preparation steps prior to detection. 
Relevant Literature 

U.S. Patents of interest include: 5,573,905 and 5,723,598. Also of interest are Akespn 
et al., "Microsecond time-scale discrimination between polycytidylic acid, polyadenylic acid 
and polyuridylic acid as homopolymers or as segments within single acid as homopolymers 
or as segments within RNA molecules" Biophys.J (1999) 77:3227-3233; Wonderlin et al., 
"Optimizing planar lipid bilayer single-channel recordings for high resolution with rapid 
voltage steps" Biophys. J. (1990) 58:289-297; and Kasianowicz, et al, "Characterization of 
individual polynucleotide molecules using a membrane channel," Proc. Natl. Acad. Sci. USA 
(1996) 93: 13770-13773. See also, Benoit et al, J. Am. Chem. Soc. (1999) 121:3904; 
Hawker et al. Paper 136, PMSE Division, 217 th ACS Meeting, Anaheim (1999); Hawker et 
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al., Acc. Chem. Res. (1997) 30:373; Dao et al., J. Polymer Science Part A-Polymer 
Chemistry (1998) 36:2161-2167; Keck et al., J. Org. Chem. (1996) 61:359. 

SUMMARY OF THE INVENTION 
Targeted molecular bar codes and methods for their use in analyte detection are 
provided. The subject targeted molecular bar codes comprise a molecular bar code bonded to 
a member of a specific binding pair, usually through a linking group. The molecular bar 
codes are charged polymeric compounds, preferably negatively charged compounds, e.g. 
negatively charged block copolymers, that can be detected by a nanopore. The subject 
targeted molecular bar codes find use in a variety of different analyte detection applications, 
including screening and diagnostic applications. 

BRIEF DESCRIPTION OF THE FIGURES 
Figure 1 provides a schematic diagram of a targeted molecular bar code. 
Figure 2 provides a schematic of a nanopore device that may be used to examine 
molecular bar codes. 

Figure 3 provides an example of a current blockade profile caused by a molecular bar 
code. Specifically, Figure 3 provides an example of a current blockade profile caused by a 
molecular bar code. A single a-hemolysin channel was inserted into a bilayer giving an open 
current of 126 pA at 120 mV in 1 M KC1 buffer. During control measurements in the 
absence of a bar code, virtually no blockades were observed. Addition of A( 30 )C(70)Gp bar 
code to the cis bath immediately produced blockade events at a rate of approximately 1 per 
second. In this experiment, most bilevel events first exhibited 5 pA residual current (95 
percent current blockade, solid arrow) followed by a 19 pA residual current (84 percent 
blockade, dashed arrow). 

Figure 4 provides a representation of a general approach using targeted molecular bar 

codes. 

Figure 5 shows the current fluctuations caused by a polydT(18) bar code. Comparison 
between unimpeded current in a nanopore and current fluctuations cause by a polydT(18) bar 
code. The tope tracing (A) shows unimpeded current through a single a-hemolysin channel at 
120 mV in 1 M KC1. The lower tracing (B) shows blockades of the same channel current by 
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polydT 18 mers at 20 concentration. At this concentration, the blockade frequency was 
about 50 per second. The duration of the blockades averaged about 50 microseconds. 

Figure 6 shows the nanopore current blockades characteristic of a polyphosphate bar 
code and a polyC bar code. Nanopore current blockades characteristic of a polyphosphate bar 
5 code and a polyC bar code are shown. (A) Typical blockades caused by a nominal 65 mer of 
polyphosphate. The average amplitude of these blockades was 30 pA relative to the 
unimpeded channel current. (B) Typical blockades caused by a polycytidylic acid 125 mer. ' 
The average amplitude of these blockades was 1 ] 5 pA. These two examples show the ca.85 
pA range of blockade amplitudes that can be built into individual bar code blocks. This is 
10 seven-fold greater than the 12 pA difference that has been shown to be discernible within 
single KN A strands using a commercial patch clamp amplifier and the horizontal thin film 
device. 

Figure 7. Current profile analysis demonstrating that a nanopore device can 
discriminate between different bar codes in a mixture, (a) is polyA; (b) is polyC and (c) is 
15 poIyA+polyC. 

Figure 8. Current profile analysis for a bar code comprised of polydT oligomers 
linked to polyethylene glycol (CI 8) spacers. Specifically, this figure shows the event pattern 
caused by a block copolymer comprised of polydT DNA oligomers linked via phosphodiester 
bonds to non-DNA polyethylene glycol (CI 8) spacers. This molecular bar code 
20 (polydT (33) /PEGl 8/dT (t) /PEGl 8/polydT (3 4)) was synthesized on an ABI 392 synthesizer using 
conventional phosphoramidite chemistry. 

Figure 9 provides a schematic diagram of the protocol used to detect N-ras, as 
described in the Experimental Section, Infra. 

Figure 10. Current profile analysis of N-ras bar code. 
25 Figure 1 1 . Comparison of blockade durations for a) polydC oligomer with an internal 

10 bp hairpin and b) a polydC oligomer with the same nucleotide composition but in random 
order. 

Figure 12. Blockade duration as a function of number of base pairs in hairpins 
inserted into polydC oligomers. The upper line (dark diamonds) represents the duration mode 
30 for oligomers with base-paired hairpins. The lower line (open circles) represents the duration 
mode for oligomers with the same nucleotide composition but randomized. 
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Figure 13. Discrimination between poiydC oligomers with 10 base-pair hairpins and 
oligomers with randomized sequence based on blockade duration. Each population includes 
approximately 1000 blockade events. 

Figure 14. A model molecular bar code containing two 6 base-pair hairpins designed 
to control the rate of translocation through the nanopore. 

Figure 15. Current blockades caused by the oligonucleotide diagrammed in Figure 14. 
The arrows indicate an abrupt current increase that correlates with occupancy of the pore by 
an abasic segment built between poly dC segments. 

Figure 16. Schematic of living polymerization reaction using Braslau/Vladimir 
initiator. 

Figure 17. Representative hydrophilic monomers that may be used in the molecular 
bar codes. 

Figure 18. Representative motifs in copolymers of the subject invention. 
Figure 19. Charged or ionizable monomers that may be employed in the subject 
molecular bar codes. 

Figure 20. Derivitizable monomers that may be employed in the subject molecular bar 

codes. 

Figure 21 Motifs of different branched polymers that may be employed in the subject 
bar codes. 

Figure 22. Schematic for preparation of functionalized nitroxide Braslau/Vladimir 
initiator. 

Figure 23. Schematic for preparation of secondary hydroxy group at the terminus of a 
bar code. 

DETAILED DESCRIPTION OF THE INVENTION 
Targeted molecular bar codes and methods for their use in analyte detection are 
provided. The subject targeted molecular bar codes include a molecular bar code stably 
associated with, e.g. covalently attached, typically through a linking group, to a member of a 
specific binding pair ( see Figure 1). The molecular bar code is generally a charged polymeric 
compound, and in preferred embodiments, is a negatively charged or anionic compound, e.g. 
a negatively charged block copolymer. The subject targeted molecular bar codes find use in a 
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variety of different analyte detection applications, including screening and diagnostic 
applications. In further describing the subject invention, the targeted molecular bar codes will 
be described first followed by a discussion of methods for their use in analyte detection. 

Before the subject invention is described further, it is to be understood that the 
invention is not limited to the particular embodiments of the invention described below, as 
variations of the particular embodiments may be made and still fall within the scope of the 
appended claims. It is also to be understood that the terminology employed is for the purpose 
of describing particular embodiments, and is not intended to be limiting. Instead, the scope of 
the present invention will be established by the appended claims. 

In this specification and the appended claims, the singular forms "a," "an," and "the" 
include plural reference unless the context clearly dictates otherwise. Unless defined 
otherwise, all technical and scientific terms used herein have the same meaning as commonly 
understood to one of ordinary skill in the art to which this invention belongs. 

A critical feature of the targeted molecular bar code of the subject invention is the 
molecular bar code. The molecular bar code is a polymeric molecule that is capable of being 
translocated through a nanopore under the influence of an applied electric field (see Figure 
2). Importantly, during translocation of the molecular bar code through the nanopore, a 
reproducible signal is produced, for example a current blockade profile ( see Figure 3). Other 
signals may include fluorescence, chemiluminescence, or variations in tunneling current. By 
current blockade profile is meant the collection of current blockade data points plotted versus 
a given period of time upon application of an applied electric field to a nanopore. 
Components of the profile that may be employed include one or more of amplitude, duration 
and pattern, where usually two or more of these characteristics are detected when looking at 
the current blockade profile. The given period of time that a single bar code molecule is 
examined is generally at least about 10 microseconds, usually at least about 100 
microseconds and more usually at least about 250 microseconds, and may be as long as 10 
milliseconds or longer, but will usually not exceed about 5 milliseconds in length. The 
current blockade data points are derived from the observed change in ionic current through 
the nanopore from the cis to the trans side upon occupancy by the polymeric molecular bar 
code. Suitable current measurement devices, as well as hardware and software necessary for 
generating the current blockade profile from the observed changes in current, are well known 
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to those of skill in the art and a representative system is disclosed in the Experimental 
Section, infra. 

The polymeric molecular bar code is sufficiently long to generate a distinctive, 
reproducible signature or current blockade profile such that the molecular bar code provides a 
5 distinctive tag for the specific binding member with which it is associated, (e.g. covalently 
attached) in the targeted molecular bar code. As such, the length of the polymeric molecular 
bar code is at least about 10 nm, usually at least about 20 am and more usually at least about 
50 nm, where the length may be as long as 1000 nm or longer, but will usually not exceed 
about 500 nm and more usually will not exceed about 400 nm. The polymeric molecular bar 

10 code is either a linear or non linear, e.g. branched polymer, where non-linear polymers of 
interest include comb polymers, star polymers, and dendritic polymers. In many 
embodiments, the polymer is generally a linear molecule. The linear polymeric molecule may 
or may not assume a secondary structure under the conditions in which it is employed, e.g. 
physiologically acceptable aqueous solutions, where the secondary structure may be a helical 

15 configuration, a loop, e.g a hairpin loop, etc. The only limitation with respect to the physical 
dimensions of the molecular bar code is that it be able to assume a configuration that allows it 
to pass or translocate lengthwise through the nanopore from the cis side to the trans side. 

The polymeric molecular bar code is generally a charged molecule such that it moves 
under the influence of an applied electric field. While in the broadest sense the molecular bar 

20 code may be positively or negatively charged, it is generally negatively charged. In other 
words, the molecular bar code is typically an anionic polymeric molecule. 

The molecular bar code may be a homopolymer or a copolymer of two or more 
different types of monomers. As indicated above, the only limitation on the molecular bar 
code is that it generate a distinct and reproducible signal upon translocation through a 

25 nanopore, as described in greater detail infra. Block copolymers of interest include polymers 
of 2 or more up to 10 or more different types of monomeric units, where copolymers of 
interest include random, gradient and block copolymers. 

In many embodiments of interest, the molecular bar code is a block copolymer in 
which a plurality of blocks are covalently bonded to each other in sequential fashion. By 

30 plurality of blocks is meant at least 2, where the block copolymer will generally have from 2 
to 20 blocks, usually from 2 to 10 blocks and in many cases from 3 to 10 blocks. In the 
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molecular bar codes, the number of unique blocks may be the same as the total number of 
blocks in the copolymer or it may be less, e.g. in molecular bar codes in which a given unique 
block is repeated one or more times. In many preferred embodiments, the total number of 
distinct blocks in the copolymer is less than the total number of blocks in the polymer. In 
these preferred embodiments, the total number of distinct blocks in any given copolymer will 
generally range from 1 to 6, and usually from 2 to 4, with 3 distinct blocks being found in 
many molecular bar codes of particular interest. 

Each block in the polymer may be heteropolymeric or homopolymeric. As such, each 
block in the polymer may be made up of the same monomeric units or a plurality of different 
monomeric units. Where the block is a heteropolymer made up of a plurality of monomeric 
units, the number of different monomeric units is at least 2, and may be as high as 5 or 7 or 
more, but is generally not more than 4. In many embodiments, the heteropolymeric block is a 
polymer of 2 or 3 different monomeric units. 

The length of each block within a molecular bar code will be sufficient to generate a 
reproducible current blockade or other signal as the block translocates through a nanopore 
under an applied electric field. As such, the length of each block will generally range from 
about 1 or 5 to 50, usually from about 5 or 10 to 30 and more usually from about 10 or 15 to 
20 or 25 nm, where in many embodiments the length will be about 15 to 20 nm. Depending 
on the nature of the monomeric units of the block, the number of monomeric units in a 
particular block may vary, where the number of monomeric units in a block will range from 
about 1 to 100, usually from about 3 to 60 and more usually from about 5 to 40. 

In many preferred embodiments, the block(s) of the negatively charged or anionic 
polymeric bar codes are homopolymers of monomeric units that include a phosphate group 
(or phosphorothioate group), where the phosphate group (or phosphorothioate group) is not a 
pendant group but a member of the polymeric backbone of the bar code, i.e. the backbone of 
the polymer includes the presence of phosphodiester bonds. Of particular interest as 
monomeric units are phosphates or sugar phosphates. When the monomeric units are sugar 
phosphates, the sugar moiety of the monomeric unit will generally be from 4 to 6 carbon 
atoms, where the carbon atoms will generally form a cyclic structure, e.g. a pentose or 
hexose. Of particular interest are 6 carbon sugars forming a five-sided ring structure, 
specifically ribose and deoxyribose. The monomeric units may further include one or more 
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pendant groups, such as heterocyclic nitrogenous bases, e.g. purines and pyrimidines. 
Specific purines of interest include natural ly occurring and synthetic purines, where naturally 
occurring purines such as adenine and guanine, specifically adenine, are of particular interest. 
Likewise, specific pyrimidines of interest include naturally occurring and synthetic 
5 pyrimidines, where naturally occurring pyrimidines such as cytosine, thymine and uracil, 
particularly uracil and cytosine, are of particular interest. In many embodiments, the purines 
and pyrimidines are modified to minimize hydrogen bonding between pairs, e.g. Watson- 
Crick base-pairing is minimized. 

The block(s) of the molecular bar code are, in many embodiments, homopolymers of 
10 monomeric units having the structural formula: 



15 



20 



25 



30 




wherein: 

R is H or a heterocyclic nitrogenous base, e.g. a heterocyclic nitrogenous base that 
may or may not be capable of participating in Watson-Crick base pairing interactions, such as 
a purine or a pyrimidine; 

Y is H or OH; 

nisOorl; 

m is an integer ranging from 10 to 80; and 
qisOorl. 

Specific polymeric blocks of interest include: polycytidylic acid, polyadenylic acid, 
polyuridylic acid, polyphosphate, oligodeoxyribose phosphate (i.e. abasic polydeoxyribose 
phosphate), polyethylene glycol-phosphodiesters, oligonucleotides (including modified 
oligodeoxynucleotides such as oligo-5-nitroindole that do not form Watson-Crick base pairs), 
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polystyrenes, polyacrylic acids, polyacrylates, polyacetonitriles, polyvinylphosphonates, 
polyacetamides and the like. Of particular interest in certain embodiments are molecular bar 
codes that are fabricated from polymers with biologically inert backbones, where such 
polymers include polystyrenes, polyacrylic acids, polyacrylates, polyacetonitriles, 
polyvinylphosphonates, polyacetamides and the like. 

In many embodiments, the polymeric compound that is the molecular bar code is not 
a deoxyribonucleic acid in which all of the bases are selected from the group containing 
adenine, cytosine, thymine, and guanine bases. For example, where one of the blocks of a 
copolymer is an oiigodeoxyribonucleotide comprised of one of these bases, the molecular bar 
code will usually comprise at least one additional non-oligodeoxyribonucleotide, such as 
polyacrylic acid, polystyrene, polyacetamides, polyphosphate, oligodeoxyribosephosphate, 
etc. Accordingly, the entire molecular bar code will not be a deoxyribonucleic acid in these 
particular embodiments. 

To further modify the properties of the molecular bar code, e.g. the profil e of the bar 
code generated upon translocation through a nanopore, a number of different types of 
elements can be incorporated into the bar codes. For example, secondary structures may be 
built into the bar code, as mentioned above. Of particular interest in certain embodiments are 
hairpin structures, which may be built into the bar code to modulate the profile generated by 
the bar code, extend the translocation time of the barcode through the pore, etc. 

Hairpin structures form when a single polymer contains regions of complementary 
sequence separated by several non-complementary units. The polymer strand folds on itself 
so that the complementary regions associate, forming the stem of the hairpin. The non- 
complementary units connecting the complementary regions form a loop. Hairpin stem 
structures of interest vary in length from about 1 to 24 bp, but typically range from about 2 to 
12 and usually from about 2 to 1 0 bp in length, As such, the length of the polymeric stretch 
that gives rise to the hairpin under the conditions of use of the bar code typically ranges in 
certain embodiments from about 36 to 94 monomers, usually from about 36 to 68 monomers 
and more usually from about 56 to 64 monomers; while in other embodiments from about 6 
to 52 monomers, usually from about 8 to 28 monomers and more usually from about 8 to 24 
monomers. The polymeric stretch or domain may be made up of any sequence of monomeric 
residues, where the residues may be naturally occurring or synthetic residues (i.e. non- 
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naturally occurring residues) so long as a portion of the residues, generally the residues at 
either end of the polymeric domain, are capable of hybridizing to each other analogous to a 
Watson-Crick base pair type interaction. In other words, the polymeric domain that gives rise 
to the hairpin loop has two flanking regions that are complementary to each other, where the 
length of each complementary flanking region in terms of monomeric units ranges in length 
from about 1 to 24 units, usually from about 2 to 12 units and more usually from about 2 to 
10 units. As such, the residues are typically nucleotides, either naturally occurring or 
synthetic. Separating the flanking residues that participate in the base pairing interaction 
described above to form the hairpin loop structure is a domain of residues that do not 
participate in hybridization but serve as a spacer group. Generally, the length of this spacer 
group in terms of monomeric units ranges from about 1 to 6 units, usually from about 2 to 5 
units, and more usually from about 3 to 4 units. 

Specific bar codes of interest include: polycytidylic acid, polyadenylic acid, 
polyuridylic acid, polyA/polyC block copolymers, polyC/polyU block copolymers, polydT 
comprised of 5 to 150 monomers, random DNA oligonucleotides, abasic deoxyribose- 
phosphate polymers, DNA or non-DNA polymers containing base-paired hairpins, DNA 
hairpins, polyacrylic acids, polyacetamides, polyacrylates, and polystyrenes. 

Stably associated with, and preferably covalently attached to, the molecular bar code 
of the targeted molecular bar code, is the second requisite moiety-the member of a specific 
binding pair. The specific binding pair member may be attached either directly to the 
molecular bar code or, preferably, through a linking group, more particularly a cleave-able 
linking group. By cleave-able linking group is meant that the linking group can be cleaved 
under conditions that do not disrupt, e.g. hydrolyze, the molecular bar code but serve to 
release the entire molecular bar code from the specific binding pair member of the targeted 
molecular bar code. Preferably, linkers are those that can be incorporated into the synthesis of 
the barcode, even where automated synthesis protocols are employed. The linking group 
may be a variety of different moieties, where suitable groups include: disulfide groups, 
restriction sites, photocleavable groups, and the like. Chemically cleavable linkers of interest* 
are those that include disulfide groups, dimethoxytrityl groups, etc. Restriction sites of 
interest that may may be present in the linking groups include. EcoR I, Sma I, Kpn I, Dra I 
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and the like. Photocleavable groups of interest include: vanillin based photolabile linkers, and 
the like. 

The specific binding pair member may be any moiety capable of specifically 
interacting with a second moiety, e.g. an analyte of interest. As such, the specific binding pair 
member of the targeted molecular bar code may be a ligand or receptor, an antibody or 
binding fragment thereof, e.g. Fv, F(ab ) 2 and Fab, an antigen recognized by a circulating 
antibody and the like; a nucleic acid, e.g. oligonucleotide, etc., where the nature of the 
specific binding pair member will depend on the nature of the analyte of interest, e.g. whether 
the analyte is a receptor or other cell surface antigen, a ligand, RNA, DNA, etc. In some 
cases the polymer portion of the targeted molecular bar code may serve as the second 
requisite moiety, i.e. the specific binding pair that binds with the analyte. 

The subject targeted molecular bar codes can be fabricated using any convenient 
protocol, where a variety of protocols are known to those of skill in the art. For example, the 
polymeric molecular bar code can be synthesized in whole or in parts, where the parts are 
subsequently joined together. The resultant polymeric molecular bar code can then be 
conjugated to the binding moiety to produce the targeted molecular bar code. The manner by 
which the targeted molecular bar code is produced is not critical to the invention. The 
particular method employed will depend, at least in part, on the nature of the targeted 
molecular bar code and its constituent parts. For example, for those components of the bar 
code that are nucleic acids, chemical synthesizers and/or recombinant techniques may be 
employed. For those components that are biologically inert polymers, e.g. polystyrenes, 
polyacrylates, etc., synthetic polymerization protocols can be employed, where of particular 
protocols of interest are living polymerizations that yield products of low polydispersity and 
controlled molecular weights, e.g. as reported inBenoit et al., J. Am. Chem. Soc. (1999) 121: 
3904. Representative procedures for producing targeted molecular bar codes is provided in 
the experimental section, infra. 

The subject targeted molecular bar codes find use in a variety of applications, 
particularly analyte detection applications in which a sample is assayed for the presence of an 
analyte(s) of interest. A wide variety of analytes may be detected using the subject targeted 
molecular bar codes, where such analytes include both naturally occurring and synthetic 
compounds, e.g. biological analytes, such as antibodies, receptors, ligands, peptides, proteins, 
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nucleic acids (DNA, RNA), viruses, bacteria, toxins, etc.; and environmental analytes, e.g. 
toxins, pollutants, etc.; and the like. 

The sample that is screened in the subject methods may be obtained from a variety of 
sources. Samples, as used herein, include biological fluids such as blood, cerebrospinal fluid, 
tears, saliva, lymph, dialysis fluid, semen and the like; organ or tissue culture derived fluids; 
food and fluids extracted from cells or physiological tissues, where the cells may be 
dissociated, in the case of solid tissues, or tissue sections may be analyzed. Alternatively a 
lysate of the cells may be prepared. Other samples of interest include environmental samples, 
such as plant tissue samples, ground water samples, soil samples, and derivatives of such 
samples. Because of the nanoscale dimensions of the poer detector, the sample that is 
screened may be as small as a single biological cell (typically of 10 jam diameter) or as small 
as a defined target within a biological cell such as organelles, vescicles etc. (typically of 1 ixm 
diameter or less). 

In performing the subject methods of analyte detection, the first step is to contact the 
sample suspected of comprising the analyte of interest with a targeted molecular bar code 
having a specific binding pair member specific for the analyte of interest. The term "contact" 
is used in the broadest sense to mean any type of combining action which brings the targeted 
molecular bar into sufficiently close proximity with the analyte of interest in the sample such 
that a binding interaction will occur if the analyte of interest specific for the specific binding 
pair member of the targeted bar code is present in the sample. Contacting can be achieved in 
a variety of different ways, including introducing the targeted molecular bar code into the 
sample, and the like. 

Following contact or introduction of the targeted molecular bar code into the sample, 
the resultant reaction mixture is incubated for a sufficient period time for any specific binding 
interactions, e.g. ligand-receptor binding, hybridization, etc., between the specific binding 
pair member and analyte (if present) in the sample to occur. The particular incubation 
conditions will vary depending on the specific nature of the analyte of interest and the 
specific binding pair member, where such conditions can be readily determined by those of 
skill in the art. For example, where the specific binding pair member is an antibody and the 
analyte of interest is an antigen, the incubation solution is generally buffered in the range of 
about pH 6.5-9.5. The incubation time should be sufficient for the targeted molecular bar 
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code to bind available analyte molecules. Generally, from about 0.1 to 3 hr is sufficient, 
usually 1 hr sufficing. In an alternative example, where the analyte of interest is a particular 
DNA and the specific binding member is a DNA (or hybridizing mimetic thereof, e.g. PNA) 
having a sequence complementary to the sequence of the DNA of interest, conditions 
sufficient for hybridization are employed, where the conditions will generally be stringent 
conditions, for example, at 50°C or higher and O.lxSSC (15 mM sodium chloride/01.5 mM 
sodium citrate). Incubation results in the formation of specific binding complexes between 
the specific binding pair component of the targeted molecular bar code and the analyte(s) of 
interest, if present, in the sample being assayed. 

Following complex formation, any unbound targeted molecular bar code is separated 
from the analyte/targeted molecular bar code complexes, i.e. the bound targeted molecular 
bar codes. Any convenient separation protocol may be employed, where the particular 
separation protocol that is chosen in a given assay will depend primarily on the nature of the 
analyte. Thus, where the analyte is large, e.g. a cell, one means of separation includes 
centrifugation, with cell bound targeted molecular bar codes being separated from unbound 
targeted molecular bar codes that are free in solution. Alternatively, where the analyte of 
interest, if present, is fixed to a solid support, e.g. a polynucleotide blot on nitrocellulose 
(such as a Northern blot), the substrate may be separated from the sample and washed to 
remove unbound targeted molecular bar codes. In yet other embodiments, the separation 
protocol may involve size separation protocols, with the analyte bound targeted molecular bar 
codes being separated from the unbound targeted molecular bar codes by methods that 
discriminate and separate based on size, such as chromatography, gel electrophoresis etc. 

Following separation of the targeted molecular bar code/analyte complexes from the 
unbound or free targeted molecular bar codes, as well as any other sample constituents, the 
molecular bar code of the analyte/targeted molecular bar code complex is separated from the 
remainder of the complex, i.e. the specific binding pair member and the analyte. Separation 
generally is achieved through cleavage of the linker group. As such, separation typically 
involves treating the complex with an agent that cleaves the linking group. The agent that is 
chosen thus necessarily depends on the nature of the Unking group. For example, where the 
linking group is a photocleave-able group, the agent may be light of appropriate wavelength 
to disrupt or cleave the photocleave-able group. In many embodiments, the wavelength of 
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light used to cleave the photo-cleavable linking group ranges from about 180 nm to 400 nm, 
usually from about 250 nm to 400 nm and more usually from about 300 nm to 400nm. 
Alternatively, where the agent is a chemically cleave-able group, the agent will be a chemical 
agent capable of cleaving the group, .e.g. dithiothreitol or beta-mercaptoethanol where the 
linking group is a disulfide. In yet other embodiments where the linking group is a restriction 
site, the agent is a catalytic agent, e.g. a restriction enzyme. Importantly, this cleavage step 
should not involve the use of agents which disrupt the nature of the molecular bar code, e.g. 
that hydrolyze it. This cleavage step results in the production of a cleavage product mixture 
comprising free molecular bar code and analyte/specific binding member complex. 

Following the production of the cleavage product mixture, the free molecular bar 
codes present therein, if any, are detected and related to the presence of the analyte(s) of 
interest in the sample. Detection may or may not involve separation of the free molecular bar 
code from the remainder of the cleavage product mixture, as may be convenient or desired. 
The free molecular bar code is detected using any convenient protocol. Thus, in an 
embodiment where the bar code is capable of participating in Watson-Crick base pair 
interactions, the bar code may be detected with a complementary labeled nucleic acid, e.g. 
radioactive or fluorescently labeled KNA, where this detection step may or may not involve 
an amplification step, e.g. a PCR step. 

A particularly preferred means of detecting the molecular bar code is to translocate 
the molecular bar code through a nanopore under the influence of an applied electric field and 
observe the effect over time of the translocation on a measurable signal. One such 
measurable signal is blockade of ion current through a nanopore which generates a unique 
current blockade profile for each molecular bar code. In other words, the molecular bar code 
is "scanned" by translocating it through a nanopore, where the output of the scan is the 
current blockade profile. Once the current blockade profile is obtained, it can be compared to 
reference current blockade profiles to determine the exact identity of the specific binding 
member to which the molecular bar code was originally bound in the initial targeted 
molecular bar code. This comparing step and identification step can be done manually but is 
ideally performed by an appropriate computer hardware/software system. The above process 
is summarized in Figure 4. 
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The nanopore device that is employed in the above preferred detection method is 
typically a device that comprises a nanopore inserted into a thin film with means for applying 
an electric field across the nanopore and for measuring the resultant signal at the nanopore. 
See Figure 2. By nanopore is meant a structure having a channel or pore with a diameter of 
"nano" dimensions, where the inner diameter of the pore or channel typically ranges from 
about lto 10, usually from about 1 to 5 and more usually from about 1 to 2 nm. The nanopore 
may be synthetic or naturally occurring, where naturally occurring nanopores include 
oligomeric protein channels, such as porins, gramicidins, and synthetic peptides and the like, 
where a particularly preferred protein channel is the self-assembled heptameric channel of a - 
hemolysin. In one embodiment, the thin film into which the nanopore is inserted is a lipid 
bilayer fabricated from a wide variety of one or more different lipids, where suitable lipids 
include: phosphatidlycholine, phosphatidylserine, phosphatidylethanolamine, glycerol 
mono-oleate, and cholesterol. 

A variety of suitable thin film support devices have been reported in the literature that 
may be used to support the nanopore used to detect the molecular bar code. Such devices 
include those described in: Brutyan et al., Biochimica et Biophysica Acta (1995) 1236:339- 
344; Wonderlin et al., Biophys. J. (1990) 58:289-297; Suarez-Isla et al. Biochemistry (1983) 
22:2319-2323 as well as those disclosed and reviewed in U.S. Patent Application Serial No. 
08/405,735 entitled "Characterization of Individual Polymer Molecules Based on Monomer- 
Interface Interactions" filed on March 1 7, 1995 and having a University of California 
reference number of 91-287.-2. Of particular interest is the device described in co-pending 
U.S. Patent Application No. 60/107,307 filed November 6, 1998, the disclosure of which is 
herein incorporated by reference. 

In detecting or "scanning" the molecular bar code with a nanopore device, the first 
step is to place the molecular bar code to be detected, e.g. the released or free molecular bar 
code generated from the cleavage step described above, on the cis side of the nanopore. The 
free molecular bar code will generally be in an aqueous solution, e.g. a buffered solution, 
where the solution typically comprises one or more dissolved salts, such as potassium 
chloride and the like, and the pH ranges from about 6.0 to 9.0, and more usually from about 
7.0 to 8.5. The solution on the trans side of the nanopore may be the same or different from 
the solution on the cis side, but will also generally be an ionic buffered solution. After the 
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free molecular bar code is placed on the cis side of the pore, an electric field is applied across 
the pore, conveniently by electrodes positioned in the cis and trans side of the pore. The 
electric field that is applied is sufficient to translocate the free molecular bar codes through 
the nanopore, and may range from about 50,000 to 500,000 volts per cm, where the applied 
electric field will typically range from about 100,000 to 400,000 volts per cm and more 
usually from about 150,000 to 300,000 volts per cm. During translocation of the molecular 
bar code through the nanopore, the ion current through the pore is measured. Measurements 
of individual molecular bar codes will typically be made at least every 1 s, usually at least 
every 0. 1 s and more usually at least every 0.02 s using a single nanopore. The measured data 
values are then manipulated to produce a current blockade profile or similar output capable of 
being compared against reference outputs such that the identity of the molecular bar code can 
be determined. The presence of analyte (if any) in the sample is then related to the quantity of 
its specific molecular bar code detected that was detected by the bar code. 

Because of the specific nature of the subject analyte detection assays employing the 
subject targeted molecular bar codes, amplification of the analyte or label associated 
therewith need not be performed prior to detection. Thus, the subject methods may be used to 
detect analyte in a sample at very low levels, where the amount may be as low as 20 
picomole or lower but will at least be 1 zeptomole, and in certain embodiments at least 1 
attomole. In addition, the presence of the analyte can be detected rapidly, usually in less than 
about 2 hours, and more usually in less than about 1 hour, where the time required for analyte 
detection may be as short as 1 minute or shorter, but will be at least 5 minutes in many 
embodiments. 

The above analyte detection methods find use in a variety of different applications in 
which the detection of one or more analytes of interest is desired. Applications in which the 
subject methods find use include diagnostic and screening applications. Diagnostic 
applications include applications in which the detection of one or more specific analytes in a 
complex physiological mixture, such as the samples described above, is desired. In such 
applications, the presence of the analyte(s) of interest will generally be indicative of a 
particular disease or condition in the host from which the screened sample is derived. Thus, 
in diagnostic applications according to the subject invention, the presence of one or more 
analytes of interest is detected in a sample from the subject being diagnosed using the 
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methods described above and then related to the presence or absence of a disease or 
condition. 

In screening applications based on the subject methods, the subject targeted molecular 
bar codes are targeted, i.e. have a specific binding pair member that binds to an analyte of 
interest such as a toxin, bacterium, virion, pollutant and the like. As such, the screening 
applications can be used to screen physiological or environmental samples for the presence of 
man-made or naturally occurring toxins, pollutants, etc. 

Also provided by the subject invention are kits comprising the subject targeted 
molecular bar codes. The number of different types of targeted bar codes in the kit may range 
widely depending on the intended use of the kit, where the number may range from 1 to 
500,000 or higher, where any two targeted bar codes are considered to be of different type if 
their molecular bar code and specific binding pair member components are different. Thus, 
kits according to the subject invention may comprise a single type of targeted molecular bar 
code or 2 or more different types of targeted molecular bar codes, including sets of 4 different 
types of targeted molecular bar codes, sets of 10 different types of targeted molecular bar 
codes, sets of 64 different types of targeted molecular bar codes, sets of 100 different types of 
targeted molecular bar codes, sets of 1,000 different types of targeted molecular bar codes, 
sets of 100,000 different types of targeted molecular bar codes and sets of 500,000 types of 
targeted molecular bar codes. Thus, of interest are kits with at least 5, usually at least 10, 
more usually at least 15 different molecular bar codes, where the number of bar codes in 
these kits may be as high as 20, 100, 200, 500, 1000, 5000, 10,000, 20,0000 or more. In 
addition to the targeted molecular bar codes, the kits may further comprise one or more 
additional assay components, such as suitable buffer media, and the like. The kits may also 
include a device for detecting and scanning or reading the bar codes, such as those described 
supra, where the devices disclosed in copending U.S. Patent Application serial no. 
60/107,307 filed on November 6, 1998, are of particular interest. Finally, the kits may 
comprise instructions for using the targeted bar codes in methods of analyte detection 
according to the subject invention, where these instructions for use may be present on the kit 
packaging and/or on a package insert. 

The following examples are offered by way of illustration and not by way of 
limitation. 
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EXPERIMENTAL 

I. Preparation of a Horizontal Bilayer Containing a Single Nanopore Using a Miniature 
Horizontal Support 

A single channel was inserted into a bilayer on the horizontal aperture as follows: 

A. Formation of diphytanoyl PC/hexadecane bilayers on a horizontal aperture. 

A miniature support was manufactured as described under 'Preparation of Subject 
Devices' as disclosed in copending application serial no. 60/107,307 filed November 8, 1998, 
the disclosure of which is herein incorporated by reference. A lipid bilayer was then formed 
as follows. The aperture and the Teflon bath holding the aperture were first cleaned for 10 
min in boiling 5% nitric acid, then rinsed in nanopure water. Just before use, the aperture and 
bath were rinsed with ethanol followed by hexane, and then air dried. The aperture was then 
coated with a thin film of diphytanoyl PC (obtained from Avanti Polar Lipids, Birmingham, 
Alabama) by applying 5 ^1 of a 200 p,g per mL solution in spectroscopy grade hexane which 
was then evaporated with a light stream of air injected through the U-tube from the trans 
side. The chambers on both sides of the aperture were then filled with 65 \A of buffer 
composed of 1.0 M KC1, 5 mM HEPES/KOH at pH 7.5. Silver chloride electrodes 
manufactured using standard methods were placed directly into each bath and were attached 
to an Axopatch 200B amplifier. To paint a bilayer, a single one-centimeter-long bristle on a 
000 brush was dipped into a 3 mg per mL diphytanoyl PC solution in spectroscopy grade 
hexadecane. The bristle was then gently brushed across the aperture as viewed 4 by a standard 
dissecting microscope. A 5 mV, 60 cycle square wave was applied across the aperture as a 
seal test. Once a seal was achieved the aperture was brushed repeatedly with a clean bristle 
until a capacitance of about 0,6 \x¥ cm 2 was achieved. 

B. Insertion of individual a-hemolysin channels into the DiphytanoylPC/hexadecane 
bilayer. 



19 



WO 00/34527 PCTYUS99/30598 

ct-hemolysin lyophilized in phosphate buffer (Calbiochem, La Jolla, CA) was 
dissolved in nanopure water at 2 jag per \xl and dispensed as 2 \il aliquots into 0.2 mL 
polypropylene tubes. These aliquots were frozen at -20° C. On the day of an experiment, a 
single tube of toxin was placed on ice and diluted in 1.0 M KC1/HEPES buffer to a final 
concentration of 0.04 \xg per jxl. One j^lI of this diluted stock was added to the cis side of the 
bilayer and mixed gently. Voltage (120 mV trans positive) was then applied across the 
bilayer. A single channel typically inserted into the bilayer within 10-60 minutes as 
indicated by an abrupt increase in current. In the event that no channel insertion was observed 
in one hour, a second 0.04 \xg aliquot of toxin was added. This generally resulted in a channel 
within an additional 15 minutes. Upon channel insertion, the cis chamber was immediately 
perfused with 2 mL of buffer, i.e. about 30 times the bath volume. Single rectifying channels 
(ca. 120 pA current in 1M KC1 at 120 mV potential trans positive) were used immediately 
for molecular bar code analysis. 

II. Use of the Device 

The nanopore device described above was used to characterize molecular bar codes as 
follows: 

A. Preparation of Molecular Bar Codes 

1 . Preparation of RN A Homopolymer Molecular Bar Codes 

Homopolymers of polycytidylic acid and polyadenylic acid (2000+ nt) were 
purchased from Fluka (Ronkonkoma, NY). To generate shorter fragments, 5 mg of full 
length RNA homopolymers were weighed into a 12 mL polypropylene tube. To this was 
added 1 mL of alkaline buffer (pH 10.2, 40 mM NaHC03, 60 mM Na2C0 3 ) pre-warmed to 

60° C. For a product ranging in size from 100 to 500 nt in length, the solution was incubated 
at 60° C for 23.5 minutes and the reaction stopped by adding 100 ill of 3 M sodium acetate, 
pH 5.2, and 50 jal of 10% glacial acetic acid. The RNA was precipitated in 2.5 volumes of 
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ethanoi at -20<> C. The pellet was rinsed in 80% ethanol, then redissolved in 1 volume water 
and 1 volume 2X formamide loading buffer (90% formamide, 10% lOx MOPS RNA buffer). 
The product was loaded on an 8% polyacrylamide/MOPS gel and run at 4 volts per cm 
alongside RNA markers (Century Markers, Ambion Inc., Austin, TX). The gel was then 
examined by UV shadowing and RNA fragments of varying length were excised and eluted 
from the gel by electrophoresis. The sized RN A was then ethanol-precipitated and 
redissolved in water or pH 7. 5 TE buffer at 2-to-5 ug per ul. 

2. Synthesis of a Molecular Bar Code Comprised of an RNA Block Copolymer 

A 134 base DNA oligo-nucleotide composed of the sequence 
TAATACGACTCACTATAGGGA(A 29 )/C( 70 )GGTACCACACAC (SEQ ID NO:01) 
was purchased from Midland Certified Reagents (Midland, TX). Full-length 134 nt strands 
were separated from incomplete strands by electrophoresis on an 8% preparative PAGE/TBE 
gel at 100 V for 4 hours. The desired band was excised, the full length material was 
electroeluted from the gel slice, precipitated in ethanol, rinsed twice with 80% ethanol, air- 
dried, then dissolved in water to give a final concentration of 1 ug per ul. 

Double-stranded template was synthesized from the purified single-stranded 134mer 
using Sequenase (Amersham/U.S. Biochemical, Cleveland, OH). Briefly, 1 ugof the 
134mer (25 pmol final) were combined with 0.2 ug of a 14 base reverse complement to the 
3' end of the 134 mer (50 pmol final), 4 ul of Sequenase 5X buffer, and 3 ul nanopure water. 
This mixture was heated to 65° C for 2 minutes and gradually cooled to 4° over 30 minutes 
to permit annealing of the reverse complement to the 134 nt strand. This solution was then 
heated to 37° for two minutes and combined with 1 ul 0.1 M DTT, 2.4 ul of a 2.5 tmM dNTP 
mixture at room temperature, and 6 ul of pure water. This solution was brought to 37° C for 
1 minute, combined with 1 ul of 13U per ul Sequenase and then incubated at 37° C for 45 
minutes. The resulting double-stranded DNA product was stored at -20° C. 

RNA was synthesized using the 134nt double-stranded DNA template and a T7 RNA 
polymerase-based kit designed to give very high yields of short transcripts (Megashortscript, 
Ambion Inc., Austin, TX). Briefly, we combined, in order, at room temperature, 4 ul 
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nanopure water, 2 |ul 10X transcription buffer, 2 ^1 each of 75 mM ATP, CTP, UTP, GTP, 4 
|il of dsDNA template from the previous step, and 2 fil Megashortscript(T7 RNA 
polymerase) enzyme stock. This mixture was incubated at 37° C for 2 hours. At the end of 
the incubation, 1 \d of 2U/jiI DNAse 1 was added along with 0.25 |jig of RNAse Tl (Life 
Technologies) to cleave undesired ends of the RNA product at G residues. This digestion 
was incubated at 37° for 1 5 minutes. The product was then run on an 8% PAGE gel in 1 x 
MOPS RNA buffer at 80 V. The desired 101 nt band was excised and eluted by 
electrophoresis. The elution buffer was then exchanged for pH 7.5 TE buffer using a Bio-Rad 
30 spin column (Hercules, CA). The final product was stored at 2 \ig per |il in a -20° C 
freezer. 

3. Synthesis of Molecular Bar Codes Using a DNA Synthesizer 

Molecular bar codes were synthesized by conventional phosphoramidite chemistry 
using an automated device. The coupling and deprotection steps were in accordance with 
standard methods recommended by the manufacturer of the DNA synthesizer. The synthesis 
scale ranged from 40 nmol to 1 jamol, and the column support was typically dT-CPG with a 
pore size of 500 angstroms to 1000 angstroms. These syntheses were typically performed 
'trityl on 5 after the last coupling step, permitting purification of full length molecular bar 
codes using a widely practiced column chromatography procedure. An example molecular 
bar code synthesized using 69 coupling reactions at 0.2 umol scale was: 

PolydT (33 ) /PEG ( i8) /dT (I /PEG (18) /polydT ( 34) 
A current blockade profile for this molecular bar code is illustrated in Figure 8. 

In one embodiment, an automated DNA synthesizer is used to directly couple a bar 
code to a DNA oligomer that is a reverse complement to a targeted gene ( e.g. Xenopus 
elongation factor). A cleave-able site (the Xba I recognition sequence) is built into the 
synthetic strand between the bar code and the specific binding pair member using a hairpin 
and a C18 spacer. The complete targeted molecular bar code is then 
5nitroindole(20)/Spacer^ 

Spacer 1 8/GTGTCCAATGAC AACGATGTTGATGTGAGTCTTTTCCTTT 
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This molecule is applied to a membrane blot that is thought to contain Xef mRNA. 
Following equilibration, unbound targeted molecular bar code is washed from the surface. 
The bound material is then digested with Xba I which cuts the strand at the Xba I recognition 
sequence in the hairpin stem, thus releasing the 

5nitroindole(20)/Spacer9/dT(l)/Spacer9/dT(17) molecular bar code for analysis using a 
nanopore. 

B. Current Blockades Produced by Molecular Bar Codes 

1. Single channel current recordings. 

Current readings across single ot-hemolysin channels were acquired using an 
Axopatch 200B integrating patch clamp amplifier (Axon Instruments, Foster City, CA) in 
voltage clamp mode. Unless otherwise noted, data were acquired at 10 ills intervals in the 
whole cell configuration and were filtered at 10 kHz using a low-pass bessel filter. The 
analog signal was digitized using an Axon Instruments Digidata 1200 Series Interface, and 
then stored using pClamp 6.02 software (Axon Instruments, Foster City, CA). Before the 
addition of molecular bar codes to the cis chamber, data were acquired in gap free format for 
1 5 seconds each at 0, +120m V, and -120 mV. Molecular bar codes (1 0 -1 5 fig unless 
otherwise noted) were added to the cis chamber, and blockades of current were examined at 
120 mV (trans positive) for five minutes. Blockades were stored in pClamp 6.02 using the 
event driven format. 

2. Quantitative discrimination between RNA molecular bar codes using a nanopore 

Blockades of ionic current caused by occupancy of the a-hemolysin pore by the 
polyadenylic acid (polyA RNA) bar code were measured first. The polyA blockades (Fig. 7a) 
fell into three populations: i) relatively short (<200 p,s) blockades that reduced the current by 
40-60 pA; ii) blockades of indeterminate length that reduced the channel current by 65 pA to 
a residual current of about 50 pA (55% blockades); and iii) blockades of 1.5 to 2.5 ms that 
reduced the channel current by about 98 pA to 19 pA of residual current (84% blockades). 
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The duration of the third class of blockades wa s strand-length dependent, whereas the 
duration of the first two classes was length independent (data not shown). The pattern of 
blockades caused by polyC bar codes was easily distinguishable from the pattern for polyA 
bar codes whether the polymers were examined separately (Figs 7a and 7b) or in combination 
(Fig 7c). That is, the channel current was reduced significantly more by the polyC RNA bar 
code (typically 95% blockades) than by the polyA RNA bar code (84% blockades) and the 
polyC blockades were shorter in duration, averaging 6 us per nucleotide compared to 16 pis 
per nucleotide for polyA. Also, polyC RNA rarely induced lower amplitude blockades or 
biphasic blockades that were very common with polyA RN A. This constitutes proof of 
principle that the nanopore can quantitatively discriminate between a mixture of molecular 
bar codes in an aqueous solution. 

3 Detection of blocks within individual RNA molecular bar codes 

The preceding experiment suggested that a transition from polyA to polyC blocks 
within individual bar codes should be detectable by the a-hemolysin pore. To test this 
prediction, in vitro transcription and ribonuclease Tl digestion was used to generate a 101- 
nucleotide-long RNA bar code of the nominal composition A(30)C(70)Gp (section IIA.2 
above). Typical biphasic blockades caused by occupancy of the channel by this bar code are 
shown in Figure 3. As predicted, one component of the blockade reduced the channel current 
by 95% (consistent with the polyC block of the bar code), and the other component reduced 
the current by 84 % (consistent with the polyA block of the bar code). This proves the 
principle that blocks can be read along single molecular bar codes using a nanopore. 

4. Establishment of a unique current blockade profile for a molecular bar code 
comprised of DNA and non-DNA blocks 

A molecular bar code of the composition: 

polydT ( 33) /PEG(i8) /dT (I) /PEG (I8) /polydT {34 ) 
was synthesized on an ABI Automated DNA synthesizer using conventional phosphoramidite 
chemistry as outlined in Section IT. A.3 above. This molecule was then examined using a 
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nanopore as described in Section tl.B. 1, revealing a unique current blockade profile (Figure 
8) that was readily distinguishable from the other molecular bar codes listed in this 
application 

5 III. Detection of N-ras in a Complex Sample 

A. Synthesis of an N-ras targeted molecular bar code and its use for specifically 
detecting a segment of N-ras Exon 1 . 



10 A targeted molecular bar code was synthesized at 1 umol scale on an ABI 392 DNA 

Synthesizer using standard conditions. The targeted bar code (top of Figure 9) was composed 
of a poly dT I8 mer at its 5' end, linked by a disulfide bond (Glen Research, Sterling, VA) to 
a 50 base-long, anti-sense segment of N-ras Exon 1. This targeted bar code (10 uM final 
concentration) was hybridized at 40 degrees C for four hours in 6x SSC to either a 40 base 

15 segment of N-ras Exon 1 (sense strand) or to a 70 base segment of the p53 open reading 
frame. Each of the target oligonucleotides was attached to polystyrene beds to facilitate 
recovery and washing of the bound bar code. Following hybridization, the beads were 
washed twice in 0.2 x SSC at room temperature. The beads were then suspended in 20 ul of 
0. 1M DTT (pH 8.3/Tris) for 30 minutes to cleave the disulfide linkage between the dTig code 

20 and the N-ras targeting sequence. The beads were subsequently spun down and the dT, 8 bar 
code was recovered in the supernatant. One half the supernatant was then added to ca. 100 ul 
of 1M KC1 buffer bathing a single alpha hemolysin nanopore. The recovered bar codes were 
then counted by translocating them through the pore at 150 mV applied potential and 
identifying dT 18 -specific channel blockades. A diagram of the above protocol is provided in 

25 Fig. 9. 



B N-ras specific bar codes detected by the alpha hemolysin nanopore. 

Fig. 10 provides the results following detection of the bar codes obtained from A 
30 above. Panel A of Fig. 10 shows the unimpeded current through a single alpha hemolysin 
channel at 150 mV in 1M KC1 (HEPES/pH 8.0). Panel B shows the current through the same 
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channel following addition of bar codes that had been cleaved from the N-ras antisense strand 
hybridized to N-ras target beads. Panel C depicts an expanded view of a blockade pattern 
specific to the dT ]8 code cleaved from the N-ras targeting sequence. Only blockade events 
with this specific pattern were counted as positive results. These results are presented in the 
following Table 1. 



Table 1. 

Quantitative measure of N-ras targeted bar codes hybridized to a gene specific target and to a 

negative control. 



Bar Code Targeting 
Sequence 


Target Sequence 


N-ras Specific 

Bar Code Signatures 

(Counts per Assay ± S.E.) 


N-ras oo-sense 
50mer 


N-ras sense oligo 
Attached to polystyrene 
bead 


50 + 14 


N-ras a-sense 
50raer 


p53 sense oligo 
Attached to polystyrene 
Bead 


<1 



The above results demonstrate that a molecular bar code can be specifically targeted 
to a macromolecule of interest, then cleaved from the targeting moiety at a linker, and finally 
quantified using a nanopore detector. 



IV. Synthesis of Molecular Bar Codes with Hairpin Structures 
A. Bar Code Synthesis. 

DNA oligonucleotide bar codes were synthesized by conventional phosphoramidite 
chemistry using an ABI 392 DNA Synthesizer. Each molecule was composed of 20 dC 
monomers at the 5' end, followed by a hairpin with a 4 dT turn and a 0-10 bp stem, and then 
by 20 additional dC monomers at the 3' end. As controls, oligonucleotides with the identical 
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chemical compositions but scrambled sequences were also synthesized. These 
oligonucleotides were purified by PAGE and stored in TE buffer at -20 degrees centigrade. 

B. Characterization 

The effect of hairpin structure on oligonucleotide transport through a model nanopore 
was tested by adding purified preparations to a 70 ul bath containing 1 M KC1 buffered at 
pH 8.0 (HEPES/KOH) and at room temperature. 80 to 200 mV potential was applied across a 
diphytanoyl phosphatidylcholine bilayer containing a single ct-hemolysin nanopore. This 
applied voltage permits capture and translocation of the anionic DNA strands by the 
nanopore which can be read as an abrupt decrease in monovalent ion current. The duration of 
the current blockade is an accurate measure of the time required for an individual strand to 
translocate across the nanopore. Typical blockades are shown in Figure 11. 

Figure 12 shows that including hairpins with 2-to-10 base pair stems in DNA 
oligomers increases the blockade duration by 1 5-1 8 us per base pair relative to randomized 
controls. A two-base pair difference is easily distinguishable. Furthermore, oligomers 
containing different number of base pairs but the same number of monomer units can be 
distinguished based on blockade duration alone given a sample size of one zeptomole (1000 
molecules) (Figure 13). Thus, introduction of a hairpin into a synthetic DNA strand provides 
an unambiguous means to distinguish between two 'bar code' populations at high sensitivity 
based on residence time in the nanopore. 

The base pairing structure also acts as a built-in pause to provide a better signal-to- 
noise ratio for distinct portions of an encoded polymer. For example, a section of abasic units 
in a molecular bar code passing through a nanopore should give a transient increase in current 
because the smaller size of the units will block the current less. But, the speed at which these 
highly charged units pass through the channel makes them unobservable. To pause the 
molecular bar code and make the observation possible, a 6 base-pair stem was inserted into 
an oligonucleotide before and after a 20 monomer-long abasic segment (Figure 14). This 
resulted in an easily observable current transition during occupancy of the pore by the abasic 
segment of the encoded polymer (Figure 15). Specifically, when this molecule was 
translocated through the a-hemolysin nanopore under a 120 mV applied potential, a three 
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part blockade signature was clearly evident in which the first blockade component was 
characteristic of poly dC (14 pA or 89% current reduction relative to the open channel), the 
second component was characteristic of a deoxyribose-phosphate polymer (43 pA or a 66% 
current reduction) and the last component was characteristic of random DNA sequence (13 
pA or a 90% current reduction) (Figure lb). This experiment demonstrates that a synthetic 
polymer can be read by a nanopore, and that multiple component signals are detectable. 

The above results demonstrate that inclusion of hairpin loop structures into molecular 
bar codes modulates the characteristics of the bar codes. These built-i n secondary structures 
allow better control over the rate of polymer translocation through the nanopore. That is, 
encoded polymers made from extended, single-stranded DNA traverse the a-hemolysin 
nanopore at a rate of 1 - 3 ns/nucleotide. The above demonstrates that including hairpin 
structures with 2-to-10 base pair long stems increases the traversal time by 1 5-18 [is per base 
pair. This temporal control is important for two reasons: 1) slowing traversal permits a longer 
read time for each coding segment in the pore, which results in a much better signal to noise 
ratio and higher fidelity; and 2) building hairpins into the polymers allows one to use the 
traversal time as part of the code. For example, we have shown that oligomers containing 
different numbers of base pairs but the same number of nucleotides can be distinguished from 
one-another with a sample size of 1000 molecules (one zeptomole). The above described 
secondary structures may be incorporated into any type of bar code to provide for modulated 
properties, where such bar codes also include those described in V supra, e.g. e.g. polyacrylic 
acid or polystyrene containing bar codes. In addition, non-DNA secondary structures could 
be built into the bar codes and to regulate translocation in like manner. 

V. Preparation of Synthetic Polymers for Use in Molecular Bar Codes via Nitroxide- 
Mediated Radical Polymerization 

Nitroxide-mediated radical polymerizations were used to prepare synthetic polymers, 
where the protocols employed are based on the 'Braslau/Vladimir Initiator.' See the relevant 
literature section, supra. This technique is used to effect "living" polymerizations with 
extremely low polydispersities and controlled molecular weights for a number of monomer 
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families, including styrenes, acrylates, acrylic acids, and acrylonitriles (Figure 16). Using 
the Braslau/Vladimir initiator, controlled polymers (including block copolymers) bearing a 
variety of functionalities, including carboxylic acids, epoxides, amines, perfluoroalkyl 
groups, ethers, esters, amides, nitriles and substituted aromatic groups, are prepared. In 
5 addition, polymers with more complex topologies, such as comb, star and dendritic 

structures, are prepared. As such, this method enables the production of a large number of 
unique bar codes that can be utilized in targeted molecular bar code structures. In other 
words, these preparation methods permit synthesis of molecular bar codes with a large 
selection of interchangeable polymer blocks. Furthermore, molecular bar codes with 

10 backbones inert in biological media can be synthesized! 

A representative protocol is as follows. Polymer bar codes are prepared using 
nitroxide-mediated living free radical polymerization with an alphabet of commercially 
available monomers. Both acrylates and styrenes have been shown to function well using the 
Braslau/Vladimir initiator. Initial bar codes are designed using these monomer classes as 

15 substrates. Water solubility is designed into the polymers by the use of hydrophilic acrylate 
or methacrylate derivatives to form neutral hydrophilic backbones (Figure 17). Charged 
monomers are mixed into these backbones as either bulk random copolymers, gradient 
copolymers (achieved by syringe pump addition) or block copolymers (Figure 18). A few 
examples of each monomer class are shown here, all of which are commercially available 

20 (Figure 19). 

Polymerizations are typically carried out at 120 °C with 0.05 equivalents of the 
Braslau/Vladimir initiator and 50 to 200 equivalents of monomer. We have previously 
demonstrated random copolymerization using «-butyl acrylate and a variety of fiinctionalized 
acrylate monomers (200 equivalents): good polydispersities (<1.2) are typically obtained with 

25 ratios of up to 80/20 /-butyl acrylate/copolymer. Vinyl phosphonates are also of interest. 
Block copolymers starting with acrylates followed by styrene have given good results using 
our initiator. Acrylate/acrylate block copolymers are of interest as well, 

Derivatizable monomers can also be introduced into the polymer bar codes (Figure 
20). These can be protected forms of ionizable species. For example, /-butyl acrylate can be 

30 used as a protected form of acrylic acid: treatment of the polymer with TFA should liberate 
the carboxylic acids. Vinyl phosphonate esters can serve as a protected form of phosphonic 
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acids. Alternatively, these derivatizable monomers can be used as handles for the 
introduction of additional charged or fluorescent moieties; these can be added after the 
polymerization has been effected. In addition, the use of monomers with reactive 
functionality offers the opportunity to introduce graphed arms onto the polymer chain. This 
can be used to manipulate the effective diameter of the polymer chain, or to design more 
complex structures such as branched or comb polymers (Figure 21). The introduction of 
lipophilic and/or sterically bulky residues into the bar codes (pre or post-polymerization) 
provides further modulation of the bar code profde. 

After polymerization, the bar codes can be designed to be attached to the targeting 
peptides at either the beginning or terminal end. Hawker's route (Figure 22) can be used to 
prepare an initiator with a pendant primary hydroxy group, or reductive cleavage of the 
"living end" of the polymer with Zn/HOAc, Na/Napth or Sml2 can be used to leave a 
secondary hydroxy group on the bar code terminus (Figure 23). 

The above results demonstrate that a large number of diverse polymers that can be 
used as targeted molecular bar codes may be synthesized using nitroxide mediated 
polymerization protocols. 

VI. Preparation of Labile Linkers 

Photolabile Linker : 

Photolabile linkers based on vanillin are synthesized as follows. Treatment of vanillin 
1 with K 2 C0 3 , provides alkylation of the phenol with 6-Bromohexanoic acid, yielding 2. 2 is 
then nitrated with fuming HNO3 and acetic acid to yield 3. 3 may be protected at the 
carboxylic acid with TMS depending on the stability of the final product. This step may or 
may not be necessary. 4 will be reduced to 5 with NaBHU. At which point 5 will be 
phosphitilated to the corresponding phosphoramidite 6. 6 Can be used in a synthesizer to 
couple the linker to an oligo, or in solution phase coupling. Below, a proposed synthesis is 
presented in scheme 1. 
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(^^d^roplyamlno^^ynoolhylCI-phosphorimldlto Scheme 1 

Standard peptide coupling using BOP reagent to activate the carboxyl group, will 
allow the peptide to be attached under mild conditions. To cleave and release the bar code at 
the appropriate time, UV light at 365-400 nm will be illuminated on the compound. The 
reaction scheme is presented below in scheme 2 

Scheme 2 




BOP,DIEA,DMF 




The proposed mechanism for photolysis of o-nitrobenzyl carbonate ester is present in the 
literature, but no specific mechanism for the photolysis of the corresponding phosphodiester 
is presented. Therefore, in scheme 3, the mechanism for a carbonate ester is shown. The 
mechanism for the phosphodiester should resemble the carbonate. 
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Scheme 3 




Dimethoxytrityl (DMT) is a common protecting group for the use of solid phase 
5 synthesis of DNA and RNA. Dimethoxytrityl is generally used to selectively protect primary 
alcohols. It is easily cleaved under mildly acidic conditions, but stable to basic work-up. A 
dimethoxytrityl group is incorporated into the cleavable linker for conjugation of an oligo to a 
protein as follows. The linker has a carboxylic acid junctional group side chain to couple 
with an amine functionality 10. Scheme 4 describes the synthesis of this linker. 
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6-Bromohexanoic acid may be protected, where desired, prior to coupling with the 
phenol. If it is protected with TMS, it yields 7 the silyl ester of 6-bromohexanoic acid. 7 
then alkylates p-hydroxy methylbenzoate to yield 8. 7 behaves like the methyl ester during 
Grignard reaction. If alkylation at the silyl ester occurs, it is necessary to reverse the addition 
order and form the tertiary alcohol before alkylating with the 6-Bromohexanoic acid. The 
next step involves a double Grignard reaction at the ester , yielding alcohol 9. If the TMS 
protection of the caboxylate is not necessary, three equivalents of the Grignard will be 
necessary. The tertiary alcohol is then substituted with CI using acetyl-Cl, converting the 
compound to the linker/protecting group 10. This halide is then coupled to a primary alcohol 
using the same procedure as DMT protection. Once coupled to an oligo, the DMT linker is 
then attached to an amine functionality using standard peptide coupling techniques. 
Treatment with trichloroacetic acid is the standard deprotection method for DMT. The linker 
is more labile than the DMT because it has three electron donating groups vs. the two on 
DMT, so a milder acid treatment cleaves it readily. The general scheme is listed below. 



MSG' 




CI 



HO— f BarCode 



Pyridine 




OMe 



TMSO' 



s PROTEIN 



1. TBAF 

2. BOP.DIEA.DMF 




PROTEIN 




PROTEIN 



OH 



HO- j BarCode 
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It is evident from the above results and discussion that the subject targeted molecular 
bar codes constitute a significant advance in the field of analyte detection. The subject bar 
codes can be read directly without amplification and labeling steps. Furthermore, the bar 
codes can be read rapidly in much shorter times than that required for methods in which other 
labels, such as fluorescent or radioactive labels, are employed. In addition, the labels can be 
read from a heterogeneous population and need not be isolated before being read. 

All publications and patent applications cited in this specification are herein 
incorporated by reference as if each individual publication or patent application were ' 
specifically and individually indicated to be incorporated by reference. The citation of any 
publication is for its disclosure prior to the filing date and should not be construed as an 
admission that the present invention is not entitled to antedate such publication by virtue of 
prior invention. 

Although the foregoing invention has been described in some detail by way of 
illustration and example for purposes of clarity of understanding, it is readily apparent to 
those of ordinary skill in the art in light of the teachings of this invention that certain changes 
and modifications may be made thereto without departing from the spirit or scope of the 
appended claims. 
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WHAT IS CLAIMED IS: 



1. A targeted molecular bar code comprising: 

(a) a molecular bar code, wherein said molecular bar code is a charged polymer 
5 capable of generating a reproducible signal upon passage through a nanopore; and 

(b) a member of a specific binding pair, wherein said specific binding pair 
member is joined directly or through a linking group to said molecular bar code. 



2. The targeted molecular bar code of Claim 1, wherein said charged polymer is 
10 negatively charged. 

3. The targeted molecular bar code according to Claim 2, wherein said negatively 
charged polymer is made up of monomeric units that comprise a moiety selected from the 
group consisting of a phosphate group or a phosphorothioate group. 

15 

4. The targeted molecular bar code according to Claim 2, wherein said polymer is a 
block copolymer of a plurality of blocks, wherein said plurality of blocks are selected from 
two or more different blocks. 

20 5. The targeted molecular bar code according to Claim 4, wherein said block 
copolymer comprises three different blocks. 

6. The targeted molecular bar code according to Claim 1, wherein said molecular bar 
code comprises a linking group. 

25 

7. The targeted molecular bar code according to Claim 6, wherein said linking group is 
a photocleavable linking group. 
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8. A targeted molecular bar code comprising: 

(a) a negatively charged block copolymer of from one to twenty blocks, wherein 
said blocks are selected from two or more different blocks, wherein each block consists of 
monomeric units comprising a phosphate group; and 

(b) a member of a specific binding pair, wherein said member of a specific 
binding pair is joined to said negatively charged block copolymer through a linking group. 

9. The targeted molecular bar code according to Claim 8, wherein said blocks are 
selected from two to four different blocks. 

10. The targeted molecular bar code according to Claim 8, wherein each block is a 
homopolymer of monomeric units selected from the group consisting of phosphates and 
sugar phosphates. 

1 1 . The targeted molecular bar code according to Claim 8, wherein said sugar 
phosphates are selected from the group consisting of ribose phosphates and deoxyribose 
phosphates. 

12. The targeted molecular bar code according to Claim 11, wherein said sugar 
phosphates may optionally comprise a heterocyclic nitrogenous base. 

13. The targeted molecular bar code according to Claim 12, wherein said heterocyclic 
nitrogenous base is a purine or a pyrimidine. 

14. The targeted molecular bar code according to Claim 8, wherein the length of each 
block of said block copolymer ranges from 15 to 25 nm. 

15. The targeted molecular bar code according to Claim 14, wherein said linker is a 
photocleavable linker. 
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16. A targeted molecular bar code comprising: 

(a) a negatively charged block copolymer of from two to twenty blocks, wherein 
said blocks are selected from a group of three different blocks, wherein each block is a 
homopolymer of monomeric units selected from the group consisting of polyphosphates, 
oligonucleotides, oligodeoxyribosephosphates; and polyethylene glycol-phosphodiesters; 

(b) a member of a specific binding pair, wherein said member of a specific 
binding pair is joined to said negatively charged block copolymer through a linking group. 

17. The targeted molecular bar code according to Claim 16, wherein said three different 
blocks are: polyethylene glycol-phosphodiesters; oligodeoxyribosephosphates; and 
oligonucleotides modified to prevent Watson-Crick base pairing. 

18. The targeted molecular bar code according to Claim 16, wherein each of said blocks 
is about 20 nm long. 

19. The targeted molecular bar code according to Claim 16, wherein said linker is a 
photocleavable linker, 

20. A method for detecting the presence of an analyte in a sample, said method 
comprising: 

(a) contacting said sample with at least one targeted molecular bar code 
according to Claim 1 under conditions sufficient for said specific binding pair to bind to 
said analyte in said sample; 

(b) separating unbound targeted molecular bar code from analyte-bound targeted 
molecular bar code; 

(c) treating said analyte bound targeted molecular bar code in a manner 
sufficient to release said molecular bar code from said analyte-bound targeted molecular bar 
code and produce free molecular bar code; 

(e) detecting the presence of said free molecular bar code; and 
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(f) relating the presence of said free molecular bar code to the presence of said 
analyte in said sample. 

21. The method according to Claim 20, wherein said detecting step comprises 
translocating said free molecular bar code through a nanopore. 

22. The method according to Claim 21, wherein said detecting step further comprises 
observing the current blockade effect of said translocation on said nanopore. 

23. The method according to Claim 20, wherein said method comprises contacting a 
plurality of different targeted molecular bar codes with said sample. 

24. The method according to Claim 23, wherein said sample size does not exceed the 
size of a biological cell. 

25. A kit for use in the detection of an analyte in a sample, said kit comprising: 
a targeted molecular bar code according to Claim 1. 

26. The kit according to Claim 25, wherein said kit comprises a plurality of different 
targeted molecular bar codes. 

27. The kit according to Claim 25, wherein said kit comprises at least 4 different types 
of targeted molecular bar codes. 

28. The kit according to Claim 25, wherein said kit comprises at least 64 different types 
of targeted molecular bar codes. 

29. The kit according to Claim 25, wherein said kit comprises at least 100 different 
types of targeted molecular bar codes. 
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FIGURE 3 
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FIGURE 5 
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FIGURE 10 
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FIGURE 12 
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FIGURE 14 
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FIGURE 21 
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